Subgroup discovery
نویسنده
چکیده
The discovery of (interesting) subgroups has a high practical relevance in all domains of science or business. For example, consider statements such as: ”the unemployment rate is above average for young men with a low educational level”, ”smokers with a positive family history are at a significantly higher risk for coronary heart disease”, or ”single males living in rural areas do rarely take out a life policy”. Subgroup discovery is well suited for finding such dependencies, i.e., discovering relations between a dependent and (several) independent variables, for inductive and explorative data analysis tasks. Then, the discovered subgroups can be applied by the analyst, e.g., for decision support. This article aims to give a first idea of subgroup discovery. We introduce the subgroup discovery task and discuss the setting and the issues concerning the search process. Then, we give an outlook on further research directions.
منابع مشابه
Subgroup Discovery – Advanced Review
Subgroup discovery is a broadly applicable descriptive data mining technique for identifying interesting subgroups according to some property of interest. This article summarizes fundamentals of subgroup discovery, before it reviews algorithms and further advanced methodological issues. In addition, we briefly discuss tools and applications of subgroup discovery approaches. In that context, we ...
متن کاملUsing Subgroup Discovery Metrics to Mine Interesting Subgraphs
While extensive work has been done in both graph mining and subgroup discovery, the potential benefits of combining the two fields have not been well studied. We propose, implement, and evaluate an adaption of an existing subgroup discovery algorithm to mine graph data. Our experiments use two different metrics from the subgroup discovery literature to demonstrate value in using such metrics to...
متن کاملAPRIORI-SD: Adapting Association Rule Learning to Subgroup Discovery
& This paper presents a subgroup discovery algorithm APRIORI-SD, developed by adapting association rule learning to subgroup discovery. The paper contributes to subgroup discovery, to a better understanding of the weighted covering algorithm, and the properties of the weighted relative accuracy heuristic by analyzing their performance in the ROC space. An experimental comparison with rule learn...
متن کاملRSD: Relational Subgroup Discovery through First-Order Feature Construction
Relational rule learning is typically used in solving classification and prediction tasks. However, relational rule learning can be adapted also to subgroup discovery. This paper proposes a propositionalization approach to relational subgroup discovery, achieved through appropriately adapting rule learning and first-order feature construction. The proposed approach, applicable to subgroup disco...
متن کاملExpert-Guided Subgroup Discovery: Methodology and Application
This paper presents an approach to expert-guided subgroup discovery. The main step of the subgroup discovery process, the induction of subgroup descriptions, is performed by a heuristic beam search algorithm, using a novel parametrized definition of rule quality which is analyzed in detail. The other important steps of the proposed subgroup discovery process are the detection of statistically s...
متن کاملTowards Knowledge-Intensive Subgroup Discovery
Subgroup discovery can be applied for exploration or descriptive induction in order to discover ”interesting” subgroups of the general population, given a certain property of interest. In domains with available background knowledge, the user usually wants to utilize this to improve the quality of the subgroup discovery results. We describe a knowledge-intensive approach for subgroup discovery u...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Wiley Interdisc. Rew.: Data Mining and Knowledge Discovery
دوره 5 شماره
صفحات -
تاریخ انتشار 2015